| Name | Version | Summary | date |
| rdatacompy |
0.1.9 |
Lightning-fast dataframe comparison library built in Rust with Python bindings |
2025-10-27 15:32:18 |
| cleaning-agent |
0.1.0 |
Intelligent data cleaning agent for automated data quality improvement |
2025-10-15 07:49:25 |
| dql-core |
0.5.2 |
Framework-agnostic validation engine for Data Quality Language (DQL) |
2025-10-10 18:28:21 |
| dql-parser |
0.5.2 |
Pure Python parser for Data Quality Language (DQL) |
2025-10-10 18:13:39 |
| autocsv-profiler |
2.0.0 |
Automated CSV data analysis with statistical profiling and visualization |
2025-10-09 11:50:39 |
| databeak |
0.1.2 |
DataBeak: MCP server for comprehensive CSV file operations with pandas-based tools |
2025-10-07 13:20:43 |
| lakehouse-engine |
1.27.1 |
A configuration-driven Spark framework serving as the engine for several lakehouse algorithms and data flows. |
2025-10-07 08:22:13 |
| parxyval |
0.1.0 |
An evaluation framework for document parsing. |
2025-10-06 10:46:31 |
| syndat |
0.13.3 |
A library for evaluation & visualization of synthetic data. |
2025-09-08 11:52:57 |
| validador-cnpj |
0.2.1 |
UDFs PySpark para limpeza, reparo, normalização e validação de CNPJ (numérico e alfanumérico). |
2025-09-04 19:30:34 |
| datacompose |
0.2.6.1 |
Copy-pasteable data transformation primitives for PySpark. Inspired by shadcn-svelte. |
2025-08-25 16:54:23 |
| cleanengine |
0.1.2 |
The Ultimate Data Cleaning & Analysis Toolkit |
2025-08-24 13:20:31 |
| csv-mcp-server |
1.0.0 |
MCP server for comprehensive CSV file operations with pandas-based tools |
2025-08-13 06:53:17 |
| sparkdq |
0.11.0 |
A declarative PySpark framework for row- and aggregate-level data quality validation. |
2025-08-09 16:03:40 |
| data-degradation-detector |
1.0.5 |
A part of my TFM/Research project handles data drift |
2025-07-21 05:45:24 |
| lawkit-python |
2.5.15 |
Python wrapper for lawkit - Statistical law analysis toolkit for fraud detection and data quality assessment |
2025-07-16 16:47:28 |
| diqu |
0.2.0 |
Data Quality CLI for the Auto-Alerts |
2024-07-08 03:56:04 |
| diqu-email |
1.0.0 |
Data Quality CLI for the Auto-Alerts - Emails |
2024-07-07 04:06:59 |
| pydeequ |
1.3.0 |
PyDeequ - Unit Tests for Data |
2024-04-26 20:35:24 |
| compars |
0.0.0 |
DataFrame comparison done right (AKA the Bear-agnostic DataFrame comparison library) |
2024-04-20 18:28:36 |